GeneRIF is a more comprehensive, current and computationally tractable source of gene-disease relationships than OMIM
نویسندگان
چکیده
Motivation: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Methods: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to data mine gene-disease relationships from both the GeneRIF and OMIM databases. We utilized a comprehensive subset of UMLS structured as a directed acyclic graph (the Disease Ontology) to filter and interpret results from MMTx. The data mining methodology was validated against the Homayouni gene collection using recall and precision measurements. Results: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% (recall) and 98% (precision) using OMIM. Our thesaurusbased approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. Contact: [email protected]
منابع مشابه
The disease and gene annotations (DGA): an annotation resource for human disease
Disease and Gene Annotations database (DGA, http://dga.nubic.northwestern.edu) is a collaborative effort aiming to provide a comprehensive and integrative annotation of the human genes in disease network context by integrating computable controlled vocabulary of the Disease Ontology (DO version 3 revision 2510, which has 8043 inherited, developmental and acquired human diseases), NCBI Gene Refe...
متن کاملGenetic Diagnosis of a Lethal Form of Autosomal Recessive Polycystic Kidney Disease
Background Autosomal recessive polycystic kidney disease (ARPKD; OMIM number 263200) is a severe early onset hereditary form of polycystic kidney and liver disease. Case Report In the current study, we present a consanguineous couple with a history of an affected son with polycystic kidney disease (PKD), hepatic failure and epileptic seizures who died at the age of 8 months. Both parents were h...
متن کاملSequence analysis of the VP1 gene in three very virulent Iranian infectious bursal disease virus strains
Infectious bursal disease (IBD) is a highly contagious disease of chickens caused by the infectious bursal disease virus (IBDV). This study was conducted to characterize three IBDV strains from Iran. A reverse transcriptase-polymerase chain reaction (RT-PCR) procedure was used to amplify a 715-bp fragment of the VP1 gene from IBDV strains. Amplified VP1 fragments of the three Iranian IBDV strai...
متن کاملFinding GeneRIFs via Gene Ontology Annotations
A Gene Reference Into Function (GeneRIF) is a concise phrase describing a function of a gene in the Entrez Gene database. Applying techniques from the area of natural language processing known as automatic summarization, it is possible to link the Entrez Gene database, the Gene Ontology, and the biomedical literature. A system was implemented that automatically suggests a sentence from a PubMed...
متن کاملA Framework for Annotating Human Genome in Disease Context
Identification of gene-disease association is crucial to understanding disease mechanism. A rapid increase in biomedical literatures, led by advances of genome-scale technologies, poses challenge for manually-curated-based annotation databases to characterize gene-disease associations effectively and timely. We propose an automatic method-The Disease Ontology Annotation Framework (DOAF) to prov...
متن کامل